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Abstract. We prove that log lcm{a € .4} = n log 2+o(n) for almost every set A C {1, • . . , n}. 
We also study the typical behavior of the logarithm of the least common multiple of sets of 
integers in {1, . . . ,n} with prescribed size. For example, we prove that, for any < < 1, 
loglcmja S ,4} = (1 — 9)n® log n + o(n®) for almost all sets A C {!,..., n} of size \n®\- 
Extremal values of loglcmja 6 A} for sets A of prescribed size are also studied. 



1. Introduction 

The function ip(n) = log lcm {k : 1 < k < n} was introduced by Chcbyshcv in his study 
of the distribution of the prime numbers. It is a well known fact that the asymptotic re- 
lation ip(n) ~ n is equivalent to the Prime Number Theorem, which was finally proved by 
Hadamard and de la Vallec Poussin. Chebyshcv's function can be generalized to ipf(n) = 
log lcm {/(fc) : 1 < f(k) < n} for a given polynomial f(x) € Z[x] and it is natural to try to 
obtain the asymptotic behavior for ipf(n). Some progress has been made in this direction. 
In [1] , the Prime Number Theorem for arithmetic progressions is exploited to get the asymptotic 
estimate when f(x) = a\x + do is a linear polynomial: 

(J,9)=l 

where q — ai/(ai,ao). The first author [3] has extended this result to quadratic polynomials. 
For a given irreducible quadratic polynomial f(x) = a^x 1 + a±x + ag G 1\x\ with 02 > the 
following asymptotic estimate holds: 

1 / \ i/ 2 / \ / \ 7/2 



(1) */<»>- 5 £j ^UJ +B UJ +0( "" 2) ' 

where the constant B = B(f) depends only on /. In the particular case of f(x) = x 2 + 1, we 

have tp f (n) = n 1 ' 2 logn 1 ' 2 + Bn 1 ' 2 + o(n 1 / 2 ) with B = 7 - 1 - ^ - £ p#2 ^j_ 1 ° SP 1 where 
7 is the Euler constant, (— ) is the Legendre's symbol and the sum is considered over all odd 
prime numbers. It has recently been proved in [4] that the error term in the previous expression 
is O (ji 1 / 2 (logn) -4 / 9+c ^ for each e > 0. When f(x) is a reducible polynomial the behavior is, 
however, different. We have (see Theorem 3 in [3]): 

(2) V/W ~ Cn 1 ' 2 
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where C is an explicit constant depending only on /. For example, when f(x) = x 2 — 1 we have 
ipf{n) ~ rt 1 / 2 . 

The asymptotic behavior of ipf(n) remains unknown for irreducible polynomials of degree 
d > 3, but it is conjectured in [3] that this is given by 

< 3 » *'<»>"■ h) fer^, 

where ad > is the cocfhcient of x d . For example, this conjecture would imply ~ 
fn^logn for f(x) = x 3 + 2. 

The above discussed results are particular cases of a more general problem, which we study 
in this work. For any set of positive integers A let us set 

tp(A) = loglcmja : a £ A}. 

Let us denote Af = {f(k) : k £ Z} and Af(n) = Af n {1, . . . , n}. We observe that ipf( n ) = 
ip(Af(n)). Consider the two quadratic polynomials /(x) = x 2 + 1 and f(x) = x 2 — 1. In 
both cases we have |^4/(n)| = n 1 / 2 + 0(1), but ijj(Af(n)) ~ gn 1/,2 logn in the first case and 
ip(Af(n)) ~ 71 1 / 2 in the second one. 

It is natural to ask if either of these compare well with the behavior of ijj(A) for a random 
subset of {1, . . . , n} of cardinality [n 1 / 2 ] or if both of these examples represent rare events. The 
main purpose of the present paper is to study these questions. 

Let us denote by (^) the set of all subsets A C {1, . . . , n} of size k. The mean value of tp in 
('"') is the quantity 



It is natural to wonder whether for a given polynomial / and k = \Af(n)\ the asymptotic 
ip(n,k) ~ ipf( n ) holds when n — » oo. If this is indeed the case, then we may also ask if 
ip(A) ~ "0 (n, fc) for almost all sets A £ (^)- Our main theorem answers these questions. 



■0 (h, fc) 



Theorem 1.1. Let c> 0, < 9 < I and k = cn e + 0(1). We then have that 

c(l - 6»)ri e logr7, - c(logc)n e + o(n e ), < 6 < 1, 

^f^n + oW, = 1, 

when n — Y oo . Furthermore, for any e > we have that 

\{Ae(W): \^(A)-J(n,k)\<eJ(n,k)}\ 
(I) 

when n — > oo. 



In particular, for < 8 < 1, our main theorem implies that ~4>{A) ~ c(l — #)n logn for almost 
every set .4 C {1, ... ,n} of size [cn J . 
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We observe that for f(x) = adx d + ■ ■ ■ + ao we have \A f (n)\ = {n/a d ) 1 / d + 0(1). Thus, if / 
is irreducible of degree d > 2 and we assume conjecture ([3]) (known to be true for d = 2), then 
Theorem 11.11 implies that ipf(n) ~ i/j(ri,k) when k = \Af(n)\ = cn 6 + 0(1) with 8 = 1/d and 
c = a d However, although the asymptotic behavior of Af(n) is like that of a typical set A 
of the same size, there are some differences in the second term. For example, if fix) — x 2 + 1, 
wc have 



ip f (n) = i?i 1/2 logn + Bn 1/2 + o{n 1/2 ), 



-0 (n, k) = ^" 1/2 log n + o(n 1/2 ), 



for k = \A f (n)\ = [Vn~^l\. The constant above is B = -0.06627563... 

For d = 1, namely /(.t) = aicc + ao, (ai,ao) = 1, the situation is different. In this case, for 
k = \Af(n)\ it is easily seen that ip(n,k) and i/jf(n) have different asymptotic behavior when 
ai > 1. More precisely, we have that 

V>(n,fe)~ l0Sai n, V'/W^tt-T 7- 
ai - 1 ^( ai ) i<I< ai 1 

(J,Oi)=l 

While we have dealt above with sets of prescribed size, it is also a natural question to ask for 
the mean value of ip(A) over all subsets A C {1, . . . , n} (define il>(0) = 0). In other words, we 
may ask for the asymptotic behavior of 

.Ac{l,...n} 

We answer this question by considering a more general problem. For a given S, < S < 1, 
select the elements of ^4 as outputs of the independent events a A, 1 < a < n, with the same 
probability P(a S .A) = 5. We denote in the sequel this probability space by S(n;S). That 
is, S(n; S) is the set of subsets of {1, . . . , n} equipped with the probability measure given by 
¥(X) = <5l x l(l — <5)"~l x l for any subset X of {1, . . . , n}. Observe that ifj n is just the expected 
value of ip(A) in the probability space iS(n;l/2). In Proposition 12.11 we obtain an explicit 
expression for ~E(t/j(A)) in the probability space S(n;d). This proposition immediately implies 
the following theorem which proves, in particular, that 

"i/) n = nlog2 + o(n). 

Theorem 1.2. Let c > and < 6 < 1. In the probability space S(n; cn 6-1 ) the expected value 
ofip(A) = loglcmja : a S A} satisfies 



c(l - 8)n e \ogn - c(logc)n e + o(n B ), < 6 < 1, 

clog(l/c) / \ /)1 

— Y^— L n + o{n), 0=1. 



E(^(A)) 

Furthermore, the variance satisfies V(ip(A)) <C n & log^ n. 
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Clearly, E(|-4|) = cn and even more, |„4| ~ cn for almost every set A, so 5(n;cn e_1 ) is 
the appropriate probability space to simulate sets A of this size. We observe that the estimate 
for ¥,(ip(A)) in Theorem 11.21 is similar to that obtained in Theorem 11.11 for i/j(n,k) when k = 
cn 8 +0(1). 

Note that the upper bound for the variance implies concentration. Indeed, for any e > we 
have that P(\ip(A) -M(i()(A))\ < eE(^(^4)) -> 1 as n ->• oo. In other words, for almost all sets A 
in the probability space S(n; cn 6 ' 1 ) and < 9 < 1 (respectively 8 = 1) the asymptotic estimates 
\A\ ~ cn 6 and ip(A) ~ c(l - 8)n 8 logn (respectively ip(A) ~ clo f^ c) n) hold. 

Our reason for considering S(n; 6) is not only because it is an interesting and natural prob- 
ability space but also because it is close to the probability space considered in Theorem 11.11 
where all the sets of size Sn are chosen with the same probability. It appears to be difficult to 
prove Theorem 11.11 directly. For this reason, our strategy will be to prove Theorem 1 1 . 2 1 first and 
then deduce Theorem 1 1 . 1 1 from it. 

The above discussion tells about what to expect from ip(A) in this setting, at least in most 
cases. As we have seen, sets Af are exceptional cases and their difference from the expected 
value depends on the irreducibility of /. What we still do not know is how far from E(ip(A)) 
the exceptional cases are. We finish our work studying the extremal behavior of ip(A) for sets 
of prescribed size. 

Theorem 1.3. Let c > and < 9 < 1. We then have that 
i) max ip(A) > cn e (\ogn)(l + oil)). 

-4C{l,...,n} 
\A\ = lcn e \ 

ii) min iJj(A) < (logn) 2+ T^8+°( 1 ). 

AC{l,—,n} 
\A\ = [cn e i 

2. CHEBYSHEV'S FUNCTION FOR RANDOM SETS IN S(n; S) AND PROOF OF THEOREM 11.21 

When A is a set of positive integers, we consider the quantity ip(A) = loglcmja : a € .4}. 
The following lemma provides us with an explicit expression for ip(A). 

Lemma 2.1. For any set of positive integers A we have 4>(A) = ^2 m A(m)/4(m), where A 
denotes the classical Von Mangoldt function and 

if An {m, 2m, 3m, ...} ^ 0, 
otherwise. 

AO {p s ,2p s ,Zp s , . . .} ^0 for s = l,...,k. □ 

Note that if A = {1, . . . ,n} then ip(A) = J2 m < n M m ) 1S the classical Chebychev's function 
tp(n). 



lA(m) = l J 



Proof. Observe that p k \ lcm{a : a 6 „4} 
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We recall that S(n; S) is the probability space with sets A C {1, . . . , n} such that all its events 
{a e .4} are independent and P(a £ A) — S for any 1 < a < n. 

2.1. Expectation. First of all we give an explicit expression for the expected value of ijj(A) in 
S(n;S). 

Proposition 2.1. Let S in (0, 1) be fixed. Then in S(n; S) we have 

SlogfS- 1 ) I 1 /n\ (1-SY \ 

- n ^hr [" + i^y X> (7) -7 1 - ) • 

where e(x) = — 1 denotes the error term in the Prime Number Theorem. 

Proof. By linearity, Lemma EOl clearly implies K(ip(A)) = J2 m <n A(m)E(/_4(m)). On the other 
hand, 

(4) E(/ A (m)) = P(^n{m,2m,...}^0) = l- P(rm ^ „4) = 1 - (1 - 5) L " /mJ . 

r<n/m 

Thus, 

E(V>(-4)) = A ( m ) ( : " (! - S) ln/ml ) ■ 

rn<n 

We observe that \n/m\ = r whenever < m < 2, so wc split the sum into intervals J r = 
(TTT'T 1 ]' obtaining 

E0M-4)) = ]T(i-(i-dy) J2 A ( m ) 

r>l mSJr 

-Di-('-'n(*(7)-*(FTi)) 

r>l v x 

r>l 

r>l r>l 

-i^»«o + »x>(=)^). 

□ 

Corollary 2.1. // Sn —> oo t/ien 

E (V,(^)) = (l + O (e- c VW] ) , 

m <S(n; <5) w/ien n — > oo . 
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Proof. We get bounds for the error term, taking into account the error term in the Prime Number 
Theorem. It is known that e(x) = 0(e~ Cl ^ l ° sx ) for some C\ > 0. For small values of x we use 
that s(x) = 0(1). For T = ^ lo f 6n) we have 

n\(l-S) r ^ (n\ {l~5) r ^ fn\(l-5) r 



r J r '■ — ' \ r 

r<T r>T 



\ r / r — ' \ r I r ^ — ' \ r ) 

r>T 

(i - sy (i - sy 



' r<T r>T 

1 ^log(^) + i^ 



T 6 



«log(5- 1 ) I e ~ Cl 



log^- 1 ) y/log(Sn) 



Taking into account inequalities (1 — S) 1 ^ 5 < e 1 and (1 — <5)/log(<5 x ) < 1 for < <5 < 1, and 
the fact that , Sn > VSn if Sn is large enough, we get 

y'log(5n) 



log^ -1 ) ^ Vfc/ fc 
for some C2 > 0. □ 



Theorem 11.21 follows from Corollary 12.11 on observing that 

= ((1 - 6) \ogn- logc)(l + CV- 1 )) 

when < 6 < 1. 

2.2. Variance. The second part of Theorem 11.21 is an immediate consequence of the following 
proposition. 

Proposition 2.2. In S(n;S) we have V(ip(A)) <C Sn log 2 n. 

Proof. By linearity of expectation we have that 

V^iAy^E^iA^-iEdiA))) 2 

= Hm)A(l)(E(I A (m)I A (l))-E(I A (m))E(I A (l))). 

m,l<.n 

We observe that if A(m)A(Z) ^ then I | m, m | Z or (m,l) = 1. Let us now study the term 
E(I A (m)I A (l)) in these cases. 
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(i) If I | m then 

E(I A (m)I A (l)) = l-(l-S)^/ m i. 

(ii) If (I, m) = 1 then 

E(I A (m)I A (l)) = 1 - (1 - <5)L"/' m J - (l - <y)Ln/iJ + (x _ £)L«/mJ+L«AJ-L"/(m*)J . 

Both of these relations are subsumed in 

E(I A (m)I A (l)) = 1 - (1 - 5)L™/mJ _ ( X _ ^L»/«J + (i _ 5 )L«/™j+L«/'J-L«('«,')/(™0J 

Thus, on using ((4]), we obtain 

A(m)A(Z) (E (J^(m)/A(/)) - E (I A (m)) E (I A (l))) 
= A(m)A(i)(l — ( 5)L™/™J + L«/'J-L"(™^)/(™0J ^l — (i — <5)L«("M)/(™0J j _ 

Finally on using the inequality 1 — (1 — x) r < rx we have 

VN>(A)) < 2Sn Y A (0 A ( m ) ( m n(l _ 5 - ) Vn/ m \ + [n/l\-Vn{ m ,l)/(ml)\ 
l</<m<n 

< 2SnY J E l ° gP ^f p' 1 (1 5) ' " /P " ' + ' n/pl ' ~ ' n/p " 

p<nl<j<k P P 



< ^E E ^d-*) 



rr 

2 



< Mn yy^ 

p<n l<k F 

<C Sn log 2 n, 
as we wanted to prove. 

3. Proof of Theorem 11.11 

Let us set ip (n, fc) = J2\ A \=k H A ) and V^O*. fe) = tsj E|^|=fc V> 2 (-4)- 
Lemma 3.1. // j < k, then for s = 1, 2 we /iaue £/ia£ 

< i^{n,k) < ^(nj) + (k s - j s )log s n. 

Proof. Suppose j < fc. There are (^Zj) ways to add k — j new elements to a set .4. € 
order to obtain a set of f^). Thus, for s = 1, 2 we have 

r(A)< ( n k Z 3 ) E v/(^u^), 

\A'\=k- 3 
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and then 



< r k _ J .) e r(AuA>) 

\A\=j ^ AnA'=V> 

\A\=j, \A'\=k-j 



n - j 
k-j 

n — j 
k - j 



E E W) 



\A"\=k AuA'=A" 

\A\=j, \A'\=k- 3 



E W) Yl 1 

\A"\=k AuA'=A" 

\A\=j, \A'\=k-j 







E wo, 

\kj \A"\=k 

which proves the first inequality. 

For the second inequality we observe that for any set A £ (s* ) and any partition in two 
sets A = A' U .A" with = j, |.A"| = k - j we have that t/>(„4) < V(-A') + 4>{A") < 
^p(A') + (k — j) log n. Similarly, 

^ 2 {A) < ^(A') + (k-j)\ogn) 2 

= ^ 2 {A') + 2ip(A')(k- j)log77 + (k - j) 2 log 2 77 

< ^ 2 (^l') + 2j(fc - j) bg 2 77 + (A - jf log 2 77 

= ^ 2 (X) + {k 2 -j 2 )\og 2 n. 

Thus, for s = 1 , 2 we have 

^ ^ (*) 1 E + - i s ) log s n) < Q 1 ^ + (fc s - j') log 5 77. 



Then, 



AC^ A'cA 
\A'\=j \A'\=j 



E^'M) < (tVE E ^(^)+(")(fc S -/)log S r7 

.A'oA 

l-4'l=J 



|.A|=fc v,?/ |.A|=fe.A'c.A 



-1 

' 77 



Y r(A') J2 1+ , (fc s -.7 s )log s n 



\A'\=j A'CA 
\A\=k 



-l 



\ ' _,.s, Al\ , ( n 

jj \k - j 



E W)+( J(fc S -J S )l0g S 77 



M| J2 r(A')+( h )(k^f)log s n, 



\A'\=j 

and the second inequality holds. □ 



THE LEAST COMMON MULTIPLE OF SETS OF POSITIVE INTEGERS 



9 



Since \A\ is the sum of n Boolean independent variables with expectation E(|^4|) = k we can 
use Chernoff's inequality to get that, for each positive real value r, 

(5) P(|L4| - k\ > r) < 2e _ w. 

We use this inequality in the proof of the next proposition. 

Proposition 3.1. For s = 1,2 we have that 

V^(n, k) = E(4> S (A)) + Oik"- 1 ' 2 log s+1/2 n) 

where K(il'(A)) is the expectation of ip(A) in S(n;k/n). 

Proof. Observe that 

E(r(A))-r(n,k) = -r(n,k)+Y^ (-) fi--J E 

J 



for s = 1,2. Whenever j and fc are close to each other, say \j ' — k\ < r for a given 1 < r < k, 
then from the previous lemma we have 



rlogn, s = 1, 



3kr log 2 n, s = 2, 



which implies 



(6) E f-Vf 1 "-) ' (") (r(n,j)-ip"(n,k)) «fc s -vi 0? 

On the other hand, if \j — k\ > r, we use the trivial estimate 

| V^(n, - V^(n, k) | < (fc s + f) log s n, 

which gives 

(7) E I*)' l 1 -*)" 3 (1)mnd)-r(n,k)) 

\j-h\>r 



n J \ n J \] 



|j-fc|>r 

We bound the summands separately: 

(8) fc*log*n E (^(^n) ' (") ^* ai °s' nP (IW-*l ^ r )< 
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and 



\j-k\>r 



n J \ n j \j 



(9) < k s log s nF(\\A\ ~ k\ > r) + log s n J2 ("V f 1 - ~) 

We rewrite the last sum in © as (recall that j < n) 

so'('-ro' 



r 



i>fe+r 



3>fe+r 

and observe that if r > 2 then j > fc + 2, so 



3 sj (n - k)> ( ") 



< 2, 



thus 

3 



(10) £ '0j S <2fc s P(ll-4|-fc|>r- S ). 

Finally, we combine the estimates JS|), ([5]), (O, ([5]) and (fTU| to get 

|E(V> s (-4)) - ^(n, k)\ < k s \og s n (j- + F(\\A\ - k\ > r - s)) 

< k s \og s n I - + e 



By taking r ~ lOyfelog k the result holds. □ 

To conclude the proof of Theorem 11.11 we combine Proposition 12.21 and Proposition 13.11 with 
k ~ cn B to get 

-L ^ (^(^l)-^(n,fc)) 2 =^(n,fc)-^ 2 (7i,fc) 

= l/(V(A)) + (^Kfc)-E(V/(^))) 

+ (E(^(^)) -J{n,k)) (E(ip(A))+J(n,k)) 

« n e log 2 n + n w ' 2 log 5 / 2 n + (n 8 ' 2 log 3/2 (ft 6 * log n) 

«7i 3e / 2 log 5 / 2 n. 
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Then Chebyshev's inequality implies that 



\{A £ g) : \${A) - f (n, k)\ > ei/>(n, k) }| yV 2 (n, fc) - ^(n, k) 
(I) e4> {n, k) 



n 39/4 log 5/4 n Jogl/4 

<C = — 5" 0, 

en 6 log n en / 4 



which concludes the proof of Theorem 11.1 



4. Extremal values 

The first part of Theorem 11.31 follows from the next proposition on taking 5 = cn®~ x . 
Proposition 4.1. Let 5 = 5(n) be a function such that < 5 < 1 and lim,^,^ Sn = oo. Then 

max ip(A) ~ nminfl, Jlogn). 

.Ac{l,...,n} 
|-4| = L<5«J 

Proof. Let ^4 C {1, . . . , n} and |^4| = [5n\ . It is clear that 

ip{A) < log a < |„4| logra < Jnlogn. 

On the other hand, it is clear that we always have ip(A) < tp(n) ~ n. Thus 

^(^4) < nmin(l,(51ogn). 
For the lower bound we distinguish two separate cases: 

- If [Sn\ > (1 — j^-^) 7T ( n ): then we consider any set A of [5n\ elements containing the 
largest ^1 — 1^777^ n(n) primes in {1, . . . , n} and get 

ma) > lo &p > ( 1 - p— ) l °zp ^ < l + 

- If [Sn\ < (1 — i^^) 71 "!"-): then we consider a set A of [<!>7iJ largest primes in {1, . . . ,n} 
and get (denoting the z-th prime by pi) 

^(•A) = Y l0 §P - l^ n J l°gP7r(n)-L«5nJ > l Sn \ logPir(n) / log n > felogn(l + o(l)). 

□ 

To prove part ii) of Theorem 11.31 we need some notation and results concerning smooth 
numbers. A number n is a y-smooth number if all the prime factors of n arc < y. It is usual to 
denote by 

®( x i y) = \{n<x : p\n => p < y}\ 
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the number of y-smooth numbers not greater than x. Canfield, Erdos and Pomerance |2] proved 
that for any e > we have 

= — rVr, 

V ' a 1 yU + 0(u) ' 

where u = log xj log y and y > (log x) 1+£ . As a consequence of this result we have 
(11) *(x;bg*a;) = x 1 - 1 ' t+0 ^ 

for any t > 1. 

We prove that there exists a set A such that |„4| = [cn e ] and ip(A) < (logn) 2+e/(1 - e)+o(1) . 
Let t be a real number satisfying VP (n; log* n) = [cn e \ . By (fTT|) we know that t = j^q + o(l). 
For this t we consider the set A of log* n-smooth integers < n, namely 

= {to < n : p | to ==> p < log* n}. 
By construction we have that |*4| = [cn e \ . It is clear that 

\cm{aeA}= Yi P L1 ° S n ' l0S Pi ■ 

p<log* n 

Thus 

^(-A)= ^ logpLlogn/logpJ < log«< (logn) 2+ T^+°( 1 ), 

jD<log* n p<log* n 

as was to be shown. 
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